Search CORE

31 research outputs found

OxfordVGG Submission to the EGO4D AV Transcription Challenge

Author: Bain Max
Huh Jaesung
Zisserman Andrew
Publication venue
Publication date: 18/07/2023
Field of study

This report presents the technical details of our submission on the EGO4D Audio-Visual (AV) Automatic Speech Recognition Challenge 2023 from the OxfordVGG team. We present WhisperX, a system for efficient speech transcription of long-form audio with word-level time alignment, along with two text normalisers which are publicly available. Our final submission obtained 56.0% of the Word Error Rate (WER) on the challenge test set, ranked 1st on the leaderboard. All baseline codes and models are available on https://github.com/m-bain/whisperX.Comment: Technical Repor

arXiv.org e-Print Archive

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

Author: Bain Max
Han Tengda
Huh Jaesung
Zisserman Andrew
Publication venue
Publication date: 11/07/2023
Field of study

Large-scale, weakly-supervised speech recognition models, such as Whisper, have demonstrated impressive results on speech recognition across domains and languages. However, their application to long audio transcription via buffered or sliding window approaches is prone to drifting, hallucination & repetition; and prohibits batched transcription due to their sequential nature. Further, timestamps corresponding each utterance are prone to inaccuracies and word-level timestamps are not available out-of-the-box. To overcome these challenges, we present WhisperX, a time-accurate speech recognition system with word-level timestamps utilising voice activity detection and forced phoneme alignment. In doing so, we demonstrate state-of-the-art performance on long-form transcription and word segmentation benchmarks. Additionally, we show that pre-segmenting audio with our proposed VAD Cut & Merge strategy improves transcription quality and enables a twelve-fold transcription speedup via batched inference.Comment: Accepted to INTERSPEECH 202

arXiv.org e-Print Archive

With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition

Author: Damen Dima
Huh Jaesung
Kazakos Vangelis
Nagrani Arsha
Zisserman Andrew
Publication venue
Publication date: 01/11/2021
Field of study

In egocentric videos, actions occur in quick succession. We capitalise on the action's temporal context and propose a method that learns to attend to surrounding actions in order to improve recognition performance. To incorporate the temporal context, we propose a transformer-based multimodal model that ingests video and audio as input modalities, with an explicit language model providing action sequence context to enhance the predictions. We test our approach on EPIC-KITCHENS and EGTEA datasets reporting state-of-the-art performance. Our ablations showcase the advantage of utilising temporal context as well as incorporating audio input modality and language model to rescore predictions. Code and models at: https://github.com/ekazakos/MTCN.Comment: Accepted at BMVC 202

arXiv.org e-Print Archive

Oxford University Research Archive

Explore Bristol Research

Spot the conversation: speaker diarisation in the wild

Author: Afouras Triantafyllos
Chung Joon Son
Huh Jaesung
Nagrani Arsha
Zisserman Andrew
Publication venue: 'International Speech Communication Association'
Publication date: 01/01/2020
Field of study

The goal of this paper is speaker diarisation of videos collected 'in the wild'. We make three key contributions. First, we propose an automatic audio-visual diarisation method for YouTube videos. Our method consists of active speaker detection using audio-visual methods and speaker verification using self-enrolled speaker models. Second, we integrate our method into a semi-automatic dataset creation pipeline which significantly reduces the number of hours required to annotate videos with diarisation labels. Finally, we use this pipeline to create a large-scale diarisation dataset called VoxConverse, collected from 'in the wild' videos, which we will release publicly to the research community. Our dataset consists of overlapping speech, a large and diverse speaker pool, and challenging background conditions.Comment: The dataset will be available for download from http://www.robots.ox.ac.uk/~vgg/data/voxceleb/voxconverse.html . The development set will be released in July 2020, and the test set will be released in October 202

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive

25th annual computational neuroscience meeting: CNS-2016

Author: Abbott L.F.
Abeysuriya Romesh G.
Aertsen Ad
Agnes Everton J.
Ahamed Tosif
Ahmadabadi Majid Nili
Ahn Sora
Aihara Kazuyuki
Aihara Kazuyuki
Andreassen Ole A.
Andreassen Ole A.
Ardestani Mohammad Hovaidi
Ardestani Mohammad Hovaidi
Arroyo David
Aton Sara J.
Babichev Andrey
Bachmann Claudia
Badel Laurent
Baek Hyeon-Man
Baek JeongHun
Baek Kwangyeol
Bahuguna Jyotika
Bak Ji Hyun
Baker Chris I.
Bakker Rembrandt
Balaguer‑Ballester Emili
Bard G.
Barnett William H.
Baroni Fabiano
Basnayake Kanishka
Baysal Velt
Bennett Matthew R.
Bernard Christophe
Berry Hugues
Beuth Frederick
Bezgin` Gleb
Bill Johannes
Birgolias Justas
Blackwell Justin
Bohnenkamp Lisa
Bojak Ingo
Borisyuk Roman
Bos Hannah
Bradley Samual P.
Breakspear Michael
Breitwieser Oliver
Briaire` Jeroen J.
Briggman Kevin L
Brinkman Braden A.
Brown John
Brown Ritchie E.
Brunel Nicolas
Buhry Laure
Buice Michael
Burkitt Anthony N.
Burton Shawn D.
Buttler Simone
Bytschok Ilja
Cantarelli Matteo
Chakravarthy V.Srinivasa
Chan Ho Ka
Chapman Phillip D.
Chatzikalymniou Alexandra Pierri
Chavane Frédéric
Chen Liang
Chen Weiliang
Cheung Chung Ching
Chhabria Karishma
Chintaluri Chaitanya
Choe Yoonsuck
Choi Hannah
Choi Hansol
Choi Ilhwan
Choi Jee Hyun
Choi Woochul
Choi Yun Seo
Choung Oh‑hyeon
Chung SueYeon
Clarke Eric F.
Clements Katie
Cloherty Shaun L.
Clopath Claudia
Cocchi Luca
Cohen Yale E.
Cook Mark
Crook Sharon M.
Cserpán Dorottya
Culmone Viviana
Dabaghian Yuri
Dabaghian Yuri
Dale Anders M.
Daly Kevin C.
Dasgupta Sakyasingha
Davey Neil
Davey Neil
Davison Andrew
de Weerd Peter
Deco Gustavo
Demkó László
Demutz Harald
Denk Cornelia
Destexhe Alain
Devor Anna
DeVuti Justin
Diamond Alan
Diesmann Markus
Dillen Kim
Doya Kenji
Dragoi Valentin
Draguljić Daniel
Drew Jordan
Drysdale Peter M.
Duarte Renato
Dura‑Bernal Salvador
Dura‑Bernal Salvador
Dura‑Bernal Salvador
Edwards Andy
Einevoll Gaute T.
Elices Irene
Elnevoll Gaute T.
Ernst Udo A.
Esler Timothy B.
Esposito Elric
Faraji Mohammad Java
Fedorov Leonid A.
Fenk Lisa M.
Ferguson Katie
Ferrario Andrea
Filipovi Marko
Fink Christian G.
Fink Gereon R.
Fishman Yonatan I.
Fornito Alex
Forrow Csaba
Fouquet Coralie
Frangou Sophia
Freestone Dean R.
Frijns Johan H. M.
Fulcher Ben D.
Fung Felix
Gajic N. Alex Cayco
Gallimore Andrew R.
Gallinaro Júlia
Gerkin Richard C.
Gerstner Wulfram
Giaffar Hamza
Giese Martin
Giese Martin
Giese Martin A.
Gilson Matthieu
Gips Bart
Gleeson Padraig
Gliske Stephen V.
Glomb Katharina
Goetze Felix
Goldsworthy Mitchell R.
Gollo Leonardo L.
Goncharenko Julia
Goodarzinic Abdorreza
Graham Bruce P.
Grayden David B.
Grayden David B.
Grewe Jan
Hadrava Michal
Hagen Espen
Halnes Geir
Halnes Geir
Hamade Khaldoun
Hamker Fred H.
Han Hio-Been
Han Seung Kee
Hansen Mads
Harper Zachary J.
He Hu
Helias Moritz
Hermann Christoph S.
Hilgetag` Claus‑Christian
Hines Michael L.
Hlinka Jaroslav
Hof Patrick R.
Holman Katherine A.
Hong Sungho
Hordacre Brenton
Howard Jr. James H.
Huang Guang-Bin
Huang Haiping
Huerta Ramon
Huh Dongsung
Hutt Axel
Hwang Dong‑Uk
Hwang Eunjin
Hye Jr. Eoon
Iannella Nicolangelo
Iannella Nicolangelo
Ibbotson Michael R.
Ionta Silvlo
Ishii Shin
Issa Fadi A.
Iyer Ramakrishnan
Jacobs Heidi
Jang Hyun Jae
Jang Jaeson
Jang Jaeson
Jensen Ole
Jeong Jaeseung
Jeong Jaesung
Jeong Yong
Jirsa Viktor K.
Jo Sumin
Joo Pangyu
Josić Kresimir
Ju Huiwen
Jun Eunji
Jun Sang Beom
Jung Nam
Jung Woo-Sung
Jung Younginha
Kahng B.
Kale Penelope J.
Kalkman Randy K.
Kameneva Tatiana
Kameneva Tatiana
Kang Jiyoung
Karoly Philippa J.
Kasumi Ohta
Kavalali Enge T.
Kawato Mitsuo
Kazama Hokto
Kedziora David J.
Kekona Tyler
Keller Daniel
Kennedy Henry
Kepple Daniel
Kerr Cliff C.
Kerr Robert R.
Kilpatrick Zachary P.
Kim Ammo J.
Kim Bowon
Kim Bowon
Kim Chang Sub
Kim DaeEun
Kim Hojeong
Kim Hoon-Hee
Kim Hyoungkyu
Kim Jae Kyoung
Kim Jimin
Kim Jinseop
Kim Juhee
Kim Minjung
Kim Seongkyun
Kim Su Hyun
Kim Sung-Phil
Kim Sung-Phil
Kim Tae
Kim Taegyo
Kim Won Sup
Kim Youngsoo
Kiser Seth A.
Klanner Felix
Kleberg Florence I.
Klingbeil Guido
Knösche Thomas
Koren Veronika
Koren Veronika
Kotaleski Jeanette Hellgren
Koulakov Alex
Kralik Jerald D.
Kringelbach Morten L.
Kruscha Alexandra
Kuhlmann Levin
Kukolja Juraj
Kumar Arvind
Kumar Arvind
Kundu Prantik
Kunze Tim
Kuravi Pradeep
Kwag Jeehyun
Kwon Jaehyung
Lai Pik‑Yin
Lakatos Peter
Latorre Roberto
Leahy Will
Lee Changju
Lee Chungho
Lee Dan D.
Lee Do-won
Lee Heonsoo
Lee Hyang Jung
Lee Hyang Woon
Lee Hyeonsu
Lee Jae Woo
Lee Jaejin
Lee Jeungmin
Lee Joonwon
Lee Jung H.
Lee Sang Wan
Lee Sang-Hun
Lee Seungjun
Lee Soohyun
Lee Sue-Hyun
Lee Tae Ho
Lee Won Hee
Lee Yong‑il
Lefebvre Baptiste
Lefebvre Jérémie
Leleu Timothée
Leng Luziwei
Levi Rafael
Levina Anna
Levy Brandon A.
Li Luozheng
Liang Guangsheng
Lidner Benjamin
Liedtke Joscha
Lim Daeseob
Lim Sewoong
Lin Xiahoan
Linder Benjamin
Lines Glenn T.
Lizler Joseph T.
Lochmann Timm
Lowet Eric
Luebke Jennifer
Lytton William W.
Lytton William W.
Lyu Cheng
Ma Hailin
Maeng Seung Eu
Malmon Gabby
Mandall Alekhya
Maouene M.
Marcelli Angelo
Marin Boris
Markin Sergey
Markram Henry
Marre Olivier
Marsalek Petr
Marsat Gary
Martel Roman
Marucci Lucia
Maturana Matias I.
McCarley Robert W.
McDonnell Mark D.
McDonnell Mark D.
McKenna James T.
McLauchlan Campbell
Meffin Hamish
Meffin Hamish
Mehta Hima
Meier Karlheinz
Meijas Jorge F.
Mellen Nick
Memmeshei Raol-Martin
Menzies Rosemary J.
Merriosn-Hort Robert
Metzner Christoph
Mi Yuanyuan
Mi Yuanyuan
Mihalas Stefan
Miller Thomas
Moezzi Bahar
Moezzi Bahar
Molkov Yaroslav I.
Moon Jangsup
Moon Seok-hun
Morris Laurel S.
Morrison Abigail
Mosqueiro Thiago S
Mu Shang
Muler Eilif
Muralidharan Vignesh
Murray John D.
Murray Micha M.
Mäki‑Marttunen Tuomo
Neymotin Samuel
Neymotin Samuel A.
Niry Mohammad
Nishikawa Isao
Nolte Max
Nowotny Thomas
Oba Shigeyuki
Obermayer Klaus
Obermayer Klaus
Ognjanovski Nicolette
Ouyang Guang
Ozer Mahmut
Paik Se-Bum
Paik Se‑Bum
Palmer S.E.
Palva Matias J.
Paninski Liam
Pariz Aref
Park Chang-hyun
Park Choongseok
Park Hae‑Jeong
Park Ji Sung
Park Memming
Park Sang-Min
Park Sol
Parsi Shervin S.
Parziale Antonio
Pasupathy Anitha
Perotti Luca
Peterson Andre
Petkoski Spase
Petrovici Mihai A.
Petterson Klas H.
Philips Ryan T.
Phillips Ryan S.
Pillow Jonathan
Pittà Maurizio De
Plogmacher Lukas
Podlaski William
Pollonini Luca
Ponce‑Alvarez Adrián
Popp Pamela Osborn
Preuschoff Kerstin
Priesemann Viola
Priesemann Viola
Priyadharsini B. Praga
Psarrou Maria
Quang Le Anh
Quintana Adrian
Ramsey Julia
Ranjan Rajnish
Rankin James
Rankin James
Rasch Malte J.
Rasuli Nader
Ratnadurai‑Giridharan Shivakeshavan
Reig Ramon
Reimann Michael W.
Rennle Chris J.
Reyes Amy
Richter René
Ridding Michael C.
Rieke Fred
Rinberg Dima
Rinzel John
Ritter Petra
Roach James P.
Robb Daniel T.
Roberts Mark J.
Robinson Peter A.
Robinson Peter A.
Rodriguez Francisco B.
Rotter Stefan
Rubchinsky Leonid L.
Rubinov Mikail
Rumbell Timothy
Rupp André
Rybak Ilya A.
Ryu Juhyoung
Sadeh Sadra
Saggio Maria L.
Sander Leonard M.
Sanger Terence D.
Sanz-Leon Paula
Sanz‑Leon Paula
Saska Daniel
Schaworonkow Natalie
Schemmel Johannes
Scheutz Matthias
Schiff Steven J.
Schilstra Maria
Schilstra Marla
Schmidt Maximilian
Schmidt Robert
Schottdorf Manual
Schutter Erik De
Schwikard Achim
Seeholzer Alexander
Seidenstein Alexandra
Sejnowski Terrence J.
Sekulić Vladisla
Senatore Rosa
Senk Johanna
Seo Sat Byul
Seung H. Sebastian
Sharpee Tatyana O.
Shea Steven
Shea-Brown Eric
Shea‑Brown Eric
Shen Kelly
Shiau LieJune
Shimazaki Hideaki
Shin Hee‑sup
Shin In-Seob
Shivkumar Sabyasach
Shlizerman Eli
Shomali Safura Rashid
Siep Silvan F.
Silberberg Gilad
Silver Angus
Silver R. Angus
Skiker K.
Skilling Quinton M.
Skinner Frances K.
Skinner Frances K.
Smit Daniel
Smith Brian
Smith Jeffrey
Soh Jaehyun
Soman Karthik
Somogyvári Zoltán
Sompolinsky Haim
Song Min
Song Min-Ho
Song Youngjo
Soundry Daniel
Sourina Olga
Spampinato Giulia Lia Beatrice
Spiegler Andreas
Spinney Richard E.
Sprecher Simon
Stacey William C.
Stacey William C.
Stephens Greg
Stern Merav
Steuber Volker
Steyn-Ross D. Alistair
Steyn-Ross Moira L.
Stimberg Marcel
Strube‑Bloss Martin F.
Stöckel David
Su Jianzhong
Sun Haoqi
Sweeney Yann
Tabas Alejandro
Tahayori Bahman
Takashima Akira
Tam Nicoladie D.
Tamagnini Francesco
Tang Rongxiang
Tang Yi-Yuan
Tang Yi-Yuan
Teka Wondimu
Tetzlaff Tom
Tezuka Taro
Toporikova Natalia
Torres Joaquin J.
Toyoizumi Taro
Tran Patricia H. P.
Trembleau Alain
Triesch Jochen
Trisch Jochen
Tsaneva‑Atanasova Krasimira
Tsuchimoto Yoshiko
Tuomo Maki-Martun
Tveito Aslak
Valizadeh Alireza
Valizadeh Alireza
van Albada Sacha J
van Albada Sacha J.
van der Eerden Jan
Varona Pablo
Varona Pablo
Veale Richard
Viriyopase Atthaphon
Vitay Julien
Vogels Rufin
Vogels Tim
Vogels Tim P.
Vogt Simon M.
Voon Valerie
Voronenko Sergej O.
Vuust Peter
Vörös János
Wallentin Mikkel
Wang Dahui
Wang Jisung
Wang Sheng-Ju
Wang Yuzhe
Warburton Julia M.
Weaver Christina M.
Wegener Detlef
Weidel Philipp
Welzig Charles M.
Werdt Stephen Van
Wibral Michael
Wickens Jeffery R.
Widmer Yves
Witek Maria A. G.
Witting Jens
Wolf Fred
Wong Michael
Wu Si
Wu Sl
Wójcik Daniel K.
Xu Zhiheng
Yamada Yasnori
Yamamura Yorkio
Yang Huei-Fang
Yang Xu
Yeon Ji Won
Yger Pierre
Yilmaz Ergin
Yoo Minsu
Yoon Sangsup
Yoshimoto Junichiro
Young-Ah Rho
Yu Suin
Zaho Yuan
Zamora Criseida
Zaptocky Martin
Zhang Mingsha
Zhang Wenhao
Zhao Chang
Zhao Xiaochen
Zhao Xuelong
Zhou Changsong
Zochowski Michal
Zochowski Michal R.
Zouridakis George
Zurowski Bartosz
Publication venue: BMC
Publication date: 01/01/2016
Field of study

The same neuron may play different functional roles in the neural circuits to which it belongs. For example, neurons in the Tritonia pedal ganglia may participate in variable phases of the swim motor rhythms [1]. While such neuronal functional variability is likely to play a major role the delivery of the functionality of neural systems, it is difficult to study it in most nervous systems. We work on the pyloric rhythm network of the crustacean stomatogastric ganglion (STG) [2]. Typically network models of the STG treat neurons of the same functional type as a single model neuron (e.g. PD neurons), assuming the same conductance parameters for these neurons and implying their synchronous firing [3, 4]. However, simultaneous recording of PD neurons shows differences between the timings of spikes of these neurons. This may indicate functional variability of these neurons. Here we modelled separately the two PD neurons of the STG in a multi-neuron model of the pyloric network. Our neuron models comply with known correlations between conductance parameters of ionic currents. Our results reproduce the experimental finding of increasing spike time distance between spikes originating from the two model PD neurons during their synchronised burst phase. The PD neuron with the larger calcium conductance generates its spikes before the other PD neuron. Larger potassium conductance values in the follower neuron imply longer delays between spikes, see Fig. 17.Neuromodulators change the conductance parameters of neurons and maintain the ratios of these parameters [5]. Our results show that such changes may shift the individual contribution of two PD neurons to the PD-phase of the pyloric rhythm altering their functionality within this rhythm. Our work paves the way towards an accessible experimental and computational framework for the analysis of the mechanisms and impact of functional variability of neurons within the neural circuits to which they belong

HAL AMU

ScholarWorks@UNIST

Juelich Shared Electronic Resources

Central Archive at the University of Reading

IUPUIScholarWorks

Springer - Publisher Connector

Harvard University - DASH

Heidelberger Dokumentenserver

PubMed Central

Archivio della Ricerca - Università di Salerno

Apollo (Cambridge)

Repository@Napier